Selective Sampling on Probabilistic Data
نویسندگان
چکیده
In the literature of supervised learning, most existing studies assume that the labels provided by the labelers are deterministic, which may introduce noise easily in many real-world applications. In many applications like crowdsourcing, however, many labelers may simultaneously label the same group of instances and thus the label of each instance is associated with a probability. Motivated by this observation, we propose a new framework where each label is enriched with a probability. In this paper, we study an interactive sampling strategy, namely, selective sampling, in which each selected instance is labeled with a probability.Specifically, we flip a coin every time when we read a new instance and decide whether it should be labeled according to the flipping result. We prove that in our setting the label complexity can be reduced dramatically. Finally, we conducted comprehensive experiments in order to verify the effectiveness of our proposed labeling framework.
منابع مشابه
Learning Probabilistic Linear-Threshold Classifiers via Selective Sampling
In this paper we investigate selective sampling, a learning model where the learner observes a sequence of i.i.d. unlabeled instances each time deciding whether to query the label of the current instance. We assume that labels are binary and stochastically related to instances via a linear probabilistic function whose coefficients are arbitrary and unknown. We then introduce a new selective sam...
متن کاملEnergy-Aware Data Collection in Sensor Networks: A Localized Selective Sampling Approach
One of the most prominent and comprehensive ways of data collection in sensor networks is to periodically extract raw sensor readings. This way of data collection enables complex analysis of data, which may not be possible with in-network aggregation or query processing. However, this flexibility in data analysis comes at the cost of power consumption. In this paper, we introduce selective samp...
متن کاملA Probabilistic Search Algorithm for Finding Optimally Directed Solutions
A new global search technique called "Probabilistic Global SearchLausanne" (PGSL), is presented in this paper. The technique is based on selective sampling of the search space according to a probability distribution function. The performance of the technique is compared with genetic algorithms using non-linear benchmark problems involving a large number of variables. For most of these problems,...
متن کاملPhoton dosimetry based on selective data sampling for the NaI(TL) detector
Radiation detection is essential for determining of radiation dose. Depend on the detector and dosimetry method, detection process is performed in different levels. Pulse counting is the first level of detection. Typically, the output of a radiation detector for determining value of the radiation dose cannot be used directly. Through changing the response function or the readout detector, is tr...
متن کاملAttention as inference: selection is probabilistic; responses are all-or-none samples.
Theories of probabilistic cognition postulate that internal representations are made up of multiple simultaneously held hypotheses, each with its own probability of being correct (henceforth, "probability distributions"). However, subjects make discrete responses and report the phenomenal contents of their mind to be all-or-none states rather than graded probabilities. How can these 2 positions...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014